Bayesian inference of phylogenetic networks from bi-allelic genetic markers
نویسندگان
چکیده
Phylogenetic networks are rooted, directed, acyclic graphs that model reticulate evolutionary histories. Recently, statistical methods were devised for inferring such networks from either gene tree estimates or the sequence alignments of multiple unlinked loci. Bi-allelic markers, most notably single nucleotide polymorphisms (SNPs) and amplified fragment length polymorphisms (AFLPs), provide a powerful source of genome-wide data. In a recent paper, a method called SNAPP was introduced for statistical inference of species trees from unlinked bi-allelic markers. The generative process assumed by the method combined both a model of evolution for the bi-allelic markers, as well as the multispecies coalescent. A novel component of the method was a polynomial-time algorithm for exact computation of the likelihood of a fixed species tree via integration over all possible gene trees for a given marker. Here we report on a method for Bayesian inference of phylogenetic networks from bi-allelic markers. Our method significantly extends the algorithm for exact computation of phylogenetic network likelihood via integration over all possible gene trees. Unlike the case of species trees, the algorithm is no longer polynomial-time on all instances of phylogenetic networks. Furthermore, the method utilizes a reversible-jump MCMC technique to sample the posterior of phylogenetic networks given bi-allelic marker data. Our method has a very good performance in terms of accuracy and robustness as we demonstrate on simulated data, as well as a data set of multiple New Zealand species of the plant genus Ourisia (Plantaginaceae). We implemented the method in the publicly available, open-source PhyloNet software package.
منابع مشابه
Accuracy of Genomic Prediction under Different Genetic Architectures and Estimation Methods
The accuracy of genomic breeding value prediction was investigated in various levels of reference population size, trait heritability and the number of quantitative trait locus (QTL). Five Bayesian methods, including Bayesian Ridge regression, BayesA, BayesB, BayesC and Bayesian LASSO, were used to estimate the marker effects for each of 27 scenarios resulted from combining three levels for her...
متن کاملPatterns of allelic variation of polysomic SSR markers in population genetic assessment of Persian Sturgeon (Acipenser persicus Borodin, 1897) in Caspian Sea
Genetic structure of Acipenser persicus, in the Caspian Sea was studied using tetrasomic microsatellite markers. A total of 195 specimens of A. persicus breeders were collected from the sampling stations located in the five fishery catch zones as well as from the Sefidrud River in the south Caspian region. About 2 g of caudal fin samples was collected from each sturgeon specimen and preserved i...
متن کاملPatterns of allelic variation of polysomic SSR markers in population genetic assessment of Persian Sturgeon (Acipenser persicus Borodin, 1897) in Caspian Sea
Genetic structure of Acipenser persicus, in the Caspian Sea was studied using tetrasomic microsatellite markers. A total of 195 specimens of A. persicus breeders were collected from the sampling stations located in the five fishery catch zones as well as from the Sefidrud River in the south Caspian region. About 2 g of caudal fin samples was collected from each sturgeon specimen and preserved i...
متن کاملGenetic Differentiation of Draa Indigenous Breed and Relationships with Other Goat Populations Assessed by Microsatellite DNA Markers
Moroccan goats are characterized by the presence of different populations identified only based on their phenotypes. The objectives of this study were to assess the genetic differentiation of the Draa goat breed and to analyze its genetic structure and its relationships with other local populations using 12 microsatellite markers. The screening was done in South Eastern and Southern Morocco on ...
متن کاملBayesian Inference of (Co) Variance Components and Genetic Parameters for Economic Traits in Iranian Holsteins via Gibbs Sampling
The aim of this study was using Bayesian approach via Gibbs sampling (GS) for estimating genetic parameters of production, reproduction and health traits in Iranian Holstein cows. Data consisted of 320666 first- lactation records of Holstein cows from 7696 sires and 260302 dams collected by the animal breeding center of Iran from year 1991 to 2010. (Co) variance components were estimated using ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره 14 شماره
صفحات -
تاریخ انتشار 2018